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An assay for testing the carcinogenic properties of a test substance comprising: (i) introducing into cells a reporter gene expression 
vector comprising a repetitive DNA sequence which exhibits instability in cancer cells, whereby instability of the repetitive DNA sequence 
affects expression of the reporter gene; (ii) exposing the resulting cells to the test substance; and (iii) determining whether the test substance 
is carcinogenic or anticarcinogenic by comparing the frequency of reporter gene expression in the resulting cells with the frequency of 
reporter gene expression in cells which have not been exposed to the test substance. The invention also provides expression vectors suitable 
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BIOLOGICAL ASSAY FOR TESTING THE CARCINOGENIC PROPERTIES OF A SUBSTANCE 

The present invention relates to a biological assay and assay reagents for 
testing the carcinogenic properties of a test substance. The assay is useful 
5 for screening potential anti-cancer drugs as well as for testing the 
carcinogenic properties of food components. 

DNA repair is an essential process in all organisms from prokaryotes to 
eukaryotes. Defective DNA repair in higher eukaryotes such as humans is 

10 an important factor in the aetiology of both hereditary and sporadic 
carcinomas. According to the current model of carcinogenesis, initiation 
is a single-cell event which leads to the development of a precancerous 
lesion by clonal expansion. Progression to an invasive tumour is a 
prolonged process requiring the acquisition of several further mutations in 

15 genes controlling cell proliferation and differentiation. As the spontaneous 
mutation rate in normal cells is relatively low it suggests that an early 
event in the pathway of tumourgenesis is a mutation that confers a so- 
called "mutator" phenotype. This postulates that defects in DNA repair 
occur at an early stage in the sequence of events and favour accelerated 

20 tumour progression. This hypothesis is corroborated by the recent link 
between cancer, microsatellite instability and mutations in the genes 
encoding the mismatch repair machinery. Mismatch repair involves the 
processing of incorrectly paired nucleotides which can occur as a result of 
normal DNA metabolism. It also plays an important role in the 

25 recognition and correction of unpaired loop structures which form during 
the replication of highly repetitive regions of DNA known as 
microsatellite DNA. If left unrepaired these loop-structures can lead to 
frame-slippage and potential loss of gene function. 



l 
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Hereditary non-polyposis cancer (HNPCC) is a cancer of the colon 
characterised by microsatellite instability. HNPCC is an autosomal 
dominant disease in which multiple members of a family suffer early onset 
5 colon cancer in the absence of polyp formation. Of the HNPCC tumours 
that exhibit microsatellite instability over 50% contain mutations in the 
HNPCC-linked hMSH2 gene on chromosome 2 and about 20-30% contain 
mutations in the HNPCC-linked hMLHl gen on chromosome 3 (Umar, A. 
&Kunkel, T.A., 1996). 

10 

HNPCC-Lynch syndromes I and II is a common cancer predisposition 
syndrome that is autosomal dominant in nature. Lynch I families suffer 
early onset colorectal cancer, while Lynch II kindreds are also susceptible 
to extra colonic epithelial tumours of the endometrium, ovary, stomach, 

15 small intestine, kidney and ureter. The link between microsatellite 
instability and cancer is demonstrated by the fact that a subset of sporadic 
colon cancers and the majority of tumours occurring in HNPCC patients 
contain frequent mutations in the simple microsatellite sequences (A)n, 
(GGC)n, or (CA)n. These mutations seem to be tumour specific with 

20 each cell containing thousands of microsatellite mutations. Studies 
involving microsatellite instability show that it is present in a significant 
number of sporadic tumours including colorectal (12-28%), endometrial 

(-1-7-23 %-)rstomach-(^ pancreatic 

(67%), oesophageal adenoma (22%), squamous cell skin (50%), and 

25 small-cell lung cancer (45%) (Eshleman, J.R. & Markowitz, S.D. 1995; 
Aaltonen et al. 1993; Merlo et al. 1994; Mironov et al. 1994; Orth et al. 
1994; Modrich, M. & Labue, R. 1996). 
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Although not fully understood, the mismatch repair pathways of both 
lower and higher eukaryotes share extensive homology. For example, 
homologues of the human mismatch repair pathway exist in 
Saccharomyces cerevisiae. 

5 Henderson and Petes, Mol and Cell Biol, June 1992, 12, No. 6, p. 2749- 
2757 have constructed reporter gene expression vectors for studying 
spontaneous frameshift mutations. The vectors are based on a plasmid 
having the LEU2 promoter and the first 12 codons of the yeast LEU2 
protein fused to the eighth codon of the E. coli p-galactosidase gene 

10 (lacZ). They inserted various oligonucleotides containing simple 
repetitive DNAs into the BamYLl site near the beginning of the p- 
galactosidase gene. 

The inserts did not shift the reading frame so that P-galactosidase 
15 expression occurred unless a frameshift mutation occurred in the host 
(yeast) cells. Such events were visible as white colonies when the cells 
were grown on a medium containing Xgal. 

Similar reporter gene expression vectors were constructed in which the 
20 repetitive DNA tract was inserted upstream of the URA3 gene. On a 
medium containing 5-fluoro-orotic acid (5-FOA) frameshift mutations 
were detected as URA3-(5-FOA R ) cell colonies. 

Levinson and Gutman (1987), Nuc. Acids Res., 15, p. 5323-5339 also used 
25 a reporter gene expression vector to study frameshift mutations in the 
prokaryote E. coli K12. The vector was based on bacteriophage Ml 3 and 
contained short poly-CA/TG tandem repeats linked to the lacZ gene which 
encodes P-galactosidase. 
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Strand et al y Nature, 365, p.274-276, 16 September 1993, used a reporter 
gene expression vector comprising a yeast promoter fused to a P- 
galactosidase gene that contained a 29-base pair out-of-frame poly (GT) 
5 tract in the coding sequence to study repetitive DNA tract instability. 
They found that mutations in any three yeast genes involved in DNA 
mismatch repair (PMS1, MLH1 and MSH2) lead to 100- to 700-fold 
increases in repetitive tract instability, whereas mutations that eliminate 
the proof reading function of DNA polymerases have little effect. 

10 

According to a first aspect of the invention there is provided an assay for 
testing the carcinogenic properties of a test substance comprising: (i) 
introducing into cells a reporter gene expression vector comprising a 
repetitive DNA sequence which exhibits instability in cancer cells, 

15 whereby instability of the repetitive DNA sequence affects expression of 
the reporter gene; (ii) exposing the resulting cells to the test substance; 
and (iii) determining whether the test substance is carcinogenic or anti- 
carcinogenic by comparing the frequency of reporter gene expression in 
the resulting cells with the frequency of reporter gene expression in cells 

20 which have not been exposed to the test substance. 

The term "carcinogenic properties" is intended to embrace the ability of 

the-test-substance-to-inhibit^ 

term embraces both carcinogenic and anti-carcinogenic properties. 

25 

By "instability" we mean a change in the size of the DNA sequence, 
normally by additions or deletions that are not a multiple of 3 bp. Such 
changes in size alter the reading frame for transcription of adjacent genes 
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Preferably, the repetitive DNA sequence (often referred to as 
microsatellite DNA) comprises a poly d(AC/TG) tract and/or a poly 
5 d(GT/CA) tract, although the tract may comprise a single nucleotide eg. 
poly d(G) or poly d(A). 

It will be appreciated that the length of the repetitive DNA tract can be 
varied and is preferably selected according to the length of the repetitive 
10 sequence identified as being unstable in the cancer cell of interest. 
However, the length of the repetitive sequence is conveniently 8 to 60 
nucleotides, more preferably 16 to 32 and especially 16. 

The term "reporter gene expression vector" is intended to cover any vector 
15 into which a reporter gene has been inserted so that, on introduction into a 
suitable host cell, the reporter gene will be transcribed and translated to 
produce the protein product of the reporter gene. 

A skilled person will appreciate that the expression vector can be provided 
20 in a variety of forms eg. a plasmid, a 'phage or a virus. 

Preferably the reporter gene expression vector comprises a promoter 
region of a gene which is normally expressed in the host cell fused to a 
sequence encoding a reporter gene product which can be expressed in the 
25 host cell. Preferably a repetitive DNA sequence which exhibits instability 
in cancer cells is inserted downstream of the promoter region into the 
open-reading frame of the reporter gene sequence. The open-reading 
frame (ORF) will be understood by skilled persons to mean a DNA 
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sequence which contains a series of triplets coding for amino acids without 
any termination codons. 



The insertion preferably "knocks" the promoter/reporter gene fusion out- 
of-frame (+1 or -1 reading frame) so that the reporter gene is not 
expressed. Hence expression of the reporter gene only occurs if the 
inserted repetitive sequence changes size (exhibits instability) so that the 
correct reading frame of the reporter gene is established. 

Alternatively, the insertion does not knock the promoter/reporter gene 
fusion out of frame so that expression of the reporter gene occurs unless 
the inserted repetitive sequence exhibits instability so that it knocks the 
promoter/reporter gene out-of-frame. 

It is within the knowledge of skilled persons to select a variety of 
promoter sequences and reporter gene sequences which can be used in a 
given host cell be it yeast, human or bacterial. The reporter gene 
expression vectors disclosed by Henderson and Petes (1992), Levinson 
and Gutman (1987), and Strand et al (1993) are incorporated herein by 
reference. 



Preferably the reporter gene expression vector is provided in the form oi 
low-or-high-copy-numb-e^lasmid, or an integrative plasmid, that is, 
plasmid which lacks a host cell origin of replication and must therefore ! 
integrated into the host cell genome for stable maintenance in the ho 
cells. 



Preferably the reporter gene comprises a gene whose expression product 
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gives rise to a visible change in the host cell. For example the gene 
product may produce a colour change or fluorescence. A particularly 
preferred reporter gene system comprises the lacZ gene which encodes the 
enzyme p-galactosidase. p-galactosidase expression can be detected as a 
5 blue colour in colonies growth on a medium containing Xgal. Colonies 
which do not express P-galactosidase appear as white colonies. Hence, 
use of the vector comprising the lacZ reporter gene according to a 
preferred embodiment provides a simple blue/white colour test for 
screening the carcinogenic effect of a test substance. 

10 

Preferably the cells used in the assay are eukaryotic cells, preferably yeast 
cells or human cells, and especially eukaryotic cells which have a defect in 
repetitive DNA instability repair mechanisms, especially the mismatch 
repair pathway. As homologues of the human mismatch repair pathway 
15 exist in yeast such as Saccharomyces cerevisiae this single cell eukaryote 
provides an ideal model for studying the effects of test substances eg. 
dietary constituents on DNA repeat instability in humans. 

Of course human cell lines can be used directly in the assays of the 
20 invention, the cell lines being derived from the cancer of interest eg. 
human colorectal cancer, especially hereditary non-polyposis cancer 
(HNPCC). Although not preferred, the assay may use prokaryotic cells, 
conveniently bacterial cells such as Escherichia coli. 

25 In a second aspect the invention provides an assay comprising testing the 
carcinogenic properties of a test substance using yeast cells according to 
the first aspect of the invention; and further testing the test substance using 
human cells according to the first aspect of the invention. 
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As yeast cells are easier to work with than human cells the first step of the 
assay according to the second aspect of the invention provides a primary 
screening step which can be used to screen out test substances exhibiting 
undesirable carcinogenic properties before the second step whose results 
have greater significance in humans. 

In a third aspect the invention provides an assay for testing the 
carcinogenic properties of a test substance comprising the steps of: (i) 
screening a test substance using a reporter gene expression vector 
according to the first aspect of the invention; and (ii) repeating the assay 
using cells containing a high, medium, low or single copy number vector, 
the high, medium, low or single copy number vector being selected 
depending on the frequency of repetitive DNA instability measured in step 
(i). 

Preferably the reporter gene expression vector in step (i) is contained in 
the cells at a high copy number. 



The assay according to the third aspect of the invention is particularly 
useful when the frequency of instability in step (i) is very high in cells 
which have not been exposed to the test substance. By selecting medium, 
low or single copy number vectors the background instability can be 
reduced-as-desi3red"tonSke"the assay more sensitive! 



Preferred non-limiting embodiments of the invention will now be 
described with reference to the accompanying drawings in which: 

Figure 1 shows the steps of a preferred assay according to the first 
aspect of the invention. The assay utilises a preferred assay plasmid of the 
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invention; 

Figure 1A shows the high copy assay plasmid pKa3-9(n) as 
previously described. The unique EcoRl site into which the respective 
tracts were cloned is underlined. Also underlined is the unique Stul site 
5 situated within the URA3 gene, used for the insertion of the KanMX4 
cassette (cf. text for further details). 

Figure IB shows the high copy assay plasmid pKa3-9(n)KanMX4 
utilising the dominant selectable marker KanMX4. Both the EcoRI site 
into which the respective tracts were cloned and the additional EcoRI site 
10 derived from the KanMX4 cloning step are underlined (cf. text for further 
details). 

Figure 2 shows a yeast/£. coli shuttle vector suitable for fusing 
yeast promoter and coding sequences to the lacZ gene of E. coli\ 

Figure 2A shows the centromere-based assay plasmid pKaCEN(n) 
15 as previously described. The unique EcoRI site into which the repetitive 
tracts were cloned is underlined. Also underlined is the unique Stul site 
situated within the URA3 gene, used for the insertion of the KanMX4 
cassette (cf. text for further details). 

Figure 2B shows the centromere-based assay plasmid 
20 pKaCEN(n)KanMX4 utilising the dominant selectable marker KanMX4. 
Both the EcoRI site into which the repetitive tracts were cloned and the 
additional EcoRI site derived from the KanMX4 cloning step are 
underlined (cf. text for further details). 

Figure 3 shows a low copy number vector containing sequences 
25 from yeast centromere VI; 

Figure 4 shows a vector which lacks a yeast origin of replication, 
so that it must be integrated into the yeast genome for stable maintenance; 

Figure 5 shows a preferred high copy number reporter gene 
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expression vector according to the invention; 

Figure 6 shows a preferred low copy number reporter gene 
expression vector according to the invention; 

Figure 7 shows a preferred single copy integrative reporter gene 
expression vector according to the invention; 

Figure 8 is a schematic representation which shows integration of 
the vector of Figure 7 at the URA3 locus of S. cerevisiae strain YN94-1; 

Figures 9 and 10 show the results of experiments to confirm 
insertion of the integrative vector of Figure 7 into the S. cerevisiae strain 
as illustrated in Figure 8; 

Figure 11 shows the synthetic pathway for the polyamines 
putrescine, spermidine and spermine in eukaryotes; 

Figure 12 is a schematic representation of frame-slippage in the 
preferred vectors leading to P-galactosidase reporter gene expression; 

Figures 13a and 13b shows a Southern blot analysis of MSH2 
genomic DNA (Figure 13a) and the disruption scheme (Figure 13b); 

Figure 14 shows a Southern blot analysis of wild type and disrupted 
MLH1 gene; 

Figure 15 shows a vector incorporating the SV40 promoter 
upstream of the luciferase gene; 

Figures 15a and 15b show Southern blot analysis of YN97-150 
(msh :: KanMX4); 



Figure 16 shows an EBV-based vector for stable expression of 
DNA in human host cells; 

Figures 16a and 16b show Southern blot analysis of YN97-167 
(rthl :: KanMX4); 

Figure 17 shows the effect of a known carcinogen on the instability 
of the high copy number expression vector of Example 1(1). 



10 



WO 98/45476 PCT/GB98/00869 

Figures 17a and 17b show Southern blot analysis of YN98-3 (pol 
130-104 ::LEU2)' y 

Figure 18 is a schematic outlining biosynthesis of purine 
nucleotides, from Jones and Fink, 1982. 
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MATERIALS 



1 EBEEEBBEB yeast STRAINS 
5 YN94-1: ade2-l, his3-ll, leu2-3, 112, trpl-1, ura3~l, can! -100. 

2. PR EFERRED R COLT STRAINS 

DH5a: F,_80/acZAM15 A(/<zcZYA-argF)U169 deoR recAl endAl 
hdsR17(r K -, m K +) phoA supEU y- thi-1 gyr A96 re/Al. 

el4-(McrA-) recAl ew/Al gyrA96 thi-1 hdsRll (r K -m K +) 
supEM relAX A(lac-proAB) [F' mzD36 proAB 
laclqZ*Ml5]. 

F-, hsdS20 (r B -, m B -), rapE44, aral4, /wA2, /psL20 (str), 
syl-5, mlt-5, supEAA, y-. 

3 PREFERRED MF.nTA 
SCD minimal/defined medium 



JM109: 



15 

RR1: 



20g glucose 

1 .7g yeast nitrogen based - without amino acids and ammonium sulphate 

25 (Difco, Detroit, USA) 

5g ammonium sulphate 

25ml "drop-out" mix 

20g agar (if plates are required) 

12 
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Made up to 1 litre with glass dist. H 2 0 and autoclaved at 121 °C for 15 
minutes. 



10 



"Drop-out" mix (40x): 



0.2g 
0.2g 
O.lg 
0.6g 



uracil 
arginine 
histidine 
isoleucine 



0.4g lysine O.lg adenine 

O.lg methionine 0.4g tyrosine 

0.6g phenylalanine 0.4g tryptophan 

0.5g threonine 0.6g leucine 



"Drop-out mix" lacking uracil (SCD-U) was used for selection of assay 
plasmids. Food components at various concentrations were added 
aseptically before the plates were poured. 



15 LB (Luria-Bertani) Medium (for growth of E. coli) 



lOg Bacto-tryptone (Difco) 

5g Bacto-yeast extract (Difco) 

lOg NaCl 

20 15g Agar (if plates are required) 



Make up to 1 litre with glass dist. H 2 0, Adjust pH to 7.5 with NaOH and 
autoclave at 121 °C for 15 minutes. 

25 For Amp R selection, ampicillin is added to a final cone, of 100(j,g/ml when 
the media has cooled to approximately 55 °C. 

4. PREFERRED REPETITIVE OLIGONUCLEOTTPKS TTSFT) 
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IN THE CONSTRUCTION OF ASSAY PT Ammo 
(purchased from Oswel DNA service, Southampton) 



Oligonucleotide 1 



(47 nt, poly d(TG) I6 , with an EcoM restriction site overhang at i 
and a Smal site, indicated by underlining.) 
Concentration. 14 13mg/ml 
T m 0.1 M Na+ 74. TC 

Smal 
1 

5'-AA1IG£££G£KiCTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGCCG-3' 



Oligonucleotide 2 



(47 nt, poly d(AC) 16 , with an EcoRI restriction site overhang (underlined) 
at its 3' end). 

Concentration 778mg/ml 
T m 0.1 M Na+ 74.1°C 



^-GGGGCGGGAGACACACACACACACACACAC ACACACACAC 
GGCTT AA -3' 

5 PREFERRED PRIMER sfqttfnt^ 



LacZ Reverse: 529 5 ' -A AGGGGGATGTGCTGCAAGG-3 ' 

14 
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YEp Forward: 8271 5 '-GCAGCGAGTCAGTGAGCGAGG-3 8291 

The numbers in subscript indicate the positions of annealing on plasmid 

YEp356R (cf. Fig. 2). 

5 PRS3 Forward: _ 50 5 ' -GATTC ATACTCTTTTTTCTACG -3 ' _ 81 

The numbers in subscript indicate the positions of annealing relative to the 
ATG start codon of PRS3. 

Repeat tracts of varying sequence and length were synthesized and 
10 inserted into the unique EcoTtl site of pSS3-9 as previously described. 

Poly (AC) 15 A assay tract 

(shifts PRS3'/LacZ fusion into +1 reading frame with respect to start 
15 ATG of PRS3) 

Oligonucleotide 1 

(poly d(AC) l5 A, with an EcoRl restiction site overhang at its 5'-end and a 
20 Smal site at its 3 f -end both indicated by underlining, giving a total length 
of 46 nts). 

5'-AATl£GGACACACACACACACACACACACACACACACAGCC£ 
GGG C 3' 

25 

Oligonucleotide 2 



15 
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(poly d(TG) J5 T, with an EcoM restiction site overhang at its 5'-end and a 
Smal site at its 3'-end both indicated by underlining, giving a total length 
of46nts). 



5'-AATTGCCCGfiQCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
CCG-3' 

Poly (C,. 3 A) telomeric assay tract 

In Saccharomyces cerevisiae the sequence poly (C,. 3 A) is found at or near 
the telomeres (Shampay et al. 1984). To examine the stability of these 
sequences a 65 bp telomeric tract was inserted into the EcoRl site of 
plasmid pSS3-9 as described. 

(insertion shifts PRS3'/LacZ fusion into -1 reading frame with respect to 
start ATG) 



Telomeric oligonucleotide 1 



(poly d(C,. 3 A), with an EcoRl restriction site overhang at its 5'-end and a 
Smal site at its 3'-end both indicated by underlining, giving a total length 
of80nts). 

5 -AAXECGGACCACACCACCACACCCACACCACACC AC AC ACC 
CACA-CACACACCACCACCCACACACCACACAG CCCGGn r^ 

Telomeric oligonucleotide 2 
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(80 nt, poly d(G N3 T), with an EcoRl restriction site overhang at its its 5'- 
end and a Smal site at its 3' -end both indicated by underlining, giving a 
total length of 80 nts). 

5 ' -AAlTGjCCCQQGCTGTGTGGTGTGTGGGTGGTGGTGTGTGTG 
TGGGTGTGTGGTGTGGTGTGGGTGTGGTGGTGTGGTCCG-3' 

Poly (A) 2 o tract 

(shifts PRS3 '/LacZ fusion into -1 reading frame with respect to start ATG) 
Oligonucleotide 1 

(poly d(A) 20 , with an EcoRI restriction site overhang at its 5* -end and a 
Smal site at its 3 '-end both indicated by underlining, giving a total length 
of 35 nts). 

5 ' -AAITCGG A A A AAAAAAAAAAAAAA AAAGCCCGGQC-3 ' 
Oligonucleotide 2 

(35 nt, poly d(T) 20 , with an EcoKL restriction site overhang at its its 5'-end 
and a Smal site at its 3' -end both indicated by underlining, giving a total 
length of 35 nts). 

5 ' - AA1TGCCQGGGCTTTTTTTTTTTTTTTTTTTTCCG-3 1 
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During the cloning of repetitive tracts into pSS3-9 a number of aberrant 
events occurred resulting in assay plasmids containing repetitive tracts as 
follows. Each of these tracts were subcloned into the centromeric and 
integrative vectors as previously described and incorporated into the assay 
system. 



fact Reading frame 

(AC) W . ! 

(AC) 41 (A) 2 (C) 2 (AC) 4 A - 1 
(AC) n + 1 

(AC), 2 A + i 

(AC) 15 A +1 

Construction of assay plasmids utilising the dominant selectable 
marker KanMX4. 

Loss of growth selection may be encountered when yeast strains 
containing the plasmid-borne URA3 marker are grown on a non-selective 
medium. This can occur when whole food extract containing growth- 
sustainable amounts of uracil are added to the medium. This problem can 
be overcome using a plasmid-borne KanMX4 dominant selectable marker 
consisting of the E. coli transposon T«903 fused to the transcriptional 
control-sequences-of-the-J£F-pne-^f-the fdamentous fungus Ashbya 
gossypii. This hybrid molecule permits the efficient selection of yeast 
transformants resistant to geneticin (G418) (Wach et al. 1994). 

Assay plasmids as described above containing the URA3 gene (cf Figs. 1A 
and 2A) were linearised by digestion at the unique Stul restriction enzyme 
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site (Promega), (+436 relative to the URA3 start codon). 40 ng of this 
linearised vector was ligated to 200 ng of a gel-purified 1481 bp 
EcoRV/Smal fragment from pFA6-KanMX4 (Wach et al. 1994) 
containing the KanMX4 cassette. An aliquot of this ligation mixture was 
5 then transformed into the E. coli strain RR1 and colonies growing on 
kanamycin were selected. Plasmid DNA was isolated from several kan- 
resistant colonies and subjected to restriction analysis. (The insertion of 
the KanMX4 cassette introduces a second EcdRl site into the assay 
plasmids as illustrated by comparing Figure 1A with IB and 2A with 2B.) 

10 

When assayed for repeat instability in S. cerevisiae these G418 selectable 
plasmids gave similar frequencies of frame-slippage to that measured for 
the original constructs relying on uracil selection, (cf. Table la). 

15 CONSTRUCTION OF MISMATCH REPAIR MUTANTS 

(a) MSH2 

The MSH2 gene of Saccharomyces cerevisiae (cf. Fig. 1) is one of several 
20 genes that share extensive homology with the bacterial MutS gene. 
Located on chromosome XV it encodes a protein of 109kda. Like the 
MutS protein, the MSH2p binds selectively to DNA containing mispairs 
and substrates containing up to 14 extra bases. Strains that contain 
mutated MSH2 genes have strongly elevated rates of spontaneous 
25 mutations and exhibit microsatellite instability (cf. Table 1). 

Disruption of MSH2 
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The MSH2 gene of Saccharomyces cerevisiae TN94-1 was disrupted with 
LEU2 (Fig. 13). Plasmid P RhB113 (Rhona Borts, Yeast Genetics, 
Institute of Molecular Medicine, John Radcliffe Hospital, Oxford 0X3 
9DU) containing the MSH2 gene disrupted with LEU2 at the SnaBI site 
was digested to completion with restriction enzyme Spel in buffer REactl 
(cf. Materials and Methods). The digestion mixture was transformed 
directly into S. cerevisiae YN904-1 (cf. Materials and Methods). 
Resulting transformants were then screened by Southern hybridization for 
the presence of the disrupted MSH2 gene (Figures 13a and b). 

Figure 13a. Southern blot analysis of YN97-10 (msh2::LEU2) 

Southern blot analysis of MSH2 genomic DNA (Fig. 13a) and 
corresponding disruption scheme (cf. Fig. 13b). Lane (1) HindUl digested 
control YN94-1 DNA. Two bands are seen (1650 and 2510 bp 
corresponding to the wild type MSH2 gene on chromosome 15). Lane (2) 
M/zdlll digestion of genomic DNA containing disrupted MSH2. The gel 
was probed with a 1662 bp EcoW fragment derived from P RhB113. A 
shift in the 2510 bp band is seen to 4710 bp corresponding to the insertion 
of LEU2. The disrupted msh2::LEU2 strain was then assayed for the 
mutator phenotype (cf. Table 1). 



(b) MLH1 



The MLH1 gene of Saccharomyces cerevisiae is one of several genes that 
shares extensive homology with the bacterial MutL gene. It is believed 
that MLHlp forms an interaction with MSH2p during the initiation of 
DNA mismatch repair in yeast. Yeast strains that contain mutated MLHl 
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genes have strongly elevated rates of spontaneous mutations and also 
exhibit microsatellite instability (cf. Table 1). The MLH1 gene of 
Saccharomyces cerevisiae YN94-1 was disrupted with LEU2. Plasmid 
pREdl82 (Rhona Borts, Yeast Genetics, Institute of Molecular Medine, 
5 John Radcliffe Hospital, Oxford 0X3 9DU) containing the MLH1 gene 
disrupted with LEU2 was digested to completion with the restriction 
enzymes SacVBamEl in buffer REact 3 (cf. Materials and Methods). This 
digestion mixture was transformed directly into S. cerevisiae YN94-1 (cf. 
Materials and Methods). Resulting transformants were then screened by 
10 Southern hybridization for the presence of the disrupted MLH1 gene on 
chromosome XIII (Fig. 14). 

Construction of mismatch repair pathway mutants for use in the 
assay system. 

15 

(c) MSH3 

The MSH3 gene of Saccharomyces cerevisiae is another gene that shares 
extensive homolgy with the bacterial MutL gene. MSH3p forms a 
20 heterodimer with MSH2p during the initiation of insertion/deletion 
mismatch repair. Yeast strains mutant for MSH3 exhibit a less profound 
increase in microsatellite instability as compared to MSH2 and MLH1 
mutant strains and have slightly elevated levels of spontaneous mutations 
(Strand etal. 1995). 

25 

Disruption of MSH3 



21 



WO 98/45476 

PCT/GB98/00869 

The MSH3 gene of Saccharomyces cerevisiae YN94-1 was disrupted with 
LEVI. P REa62 (Rhona Borts, Yeast genetics, Institute of Molecular 
Medicine, John Radcliffe Hospital, Oxford 0X3 9DU) containing the 
MSH3 gene disrupted with LEU2 was digested to completion with the 
restriction enzyme Aatll (cf. Materials and Methods). Resulting 
transformants were then screened by Southern hybridization for the 
presence of the disrupted MSH3 gene on chromosome III (data not 
shown). 

(d) MSH6 

The MSH6 gene of Saccharomyces cerevisiae is a further gene that shares 
extensive homolgy with the bacterial MutL gene. MSH6p forms a 
heterodimer with MSH2p during the initiation of spontaneous base-base 
mismatch repair (Alani et al. 1996). Yeast strains mutant for MSH6 
exhibit a less profound increase in microsatellite instability as compared to 
MSH3 mutant strains but have elevated levels of spontaneous mutations in 
comparison to MSH3 mutant strains. 

Disruption of MSH6 

The MSH6 gene of Saccharomyces cerevisiae YN94-1 was disrupted with 
KanMXl. This MSH6 disruption plasmid (Rhona Borts, Yeast genetics, 
Institute of Molecular Medicine, John Radcliffe Hospital, Oxford 0X3 
9DU) was created by cloning a 4 kb PCR fragment containing the MSH6 
gene into the Srfi site of pPCR script (Sratagene). A Pvull to EcoRV 
fragment containing the KanMX4 module (Wach et al. 1994) was then 
used to replace a SnaBl to Spel fragment of the MSH6 open reading frame 
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to create plasmid pSRC9. The msh6::KanMX4 disruption cassette was 
released by digestion with restriction enzymes SphI and BspEl and 
transformed into YN94-1 (cf. Materials and Methods). Resulting yeast 
transformants were then screened by Southern hybridization for the 
5 presence of the disrupted MSH6 gene on chromosome IV. 

Figure 15, Southern blot analysis of YN97-150 (msh6::KanMX4) 

Southern blot analysis of msh6 genomic DNA (Fig. 15A) and the 
10 corresponding disrupion scheme (cf. Fig. 15B). Lane (1) BamHl digested 
control YN94-1 DNA. No bands are seen when probed with the 1553 bp 
KanMX DNA. Lane (2) BamHl digestion of genomic DNA containing 
the disrupted msh6 gene. A single band of 2481 bp is seen corresponding 
to the insertion of KanMX4. 

15 

(d) RTH1 

The RTH1 (RAD27) gene of Saccharomyces cerevisiae is one of several 
genes encoding a 5^3' DNA exonuclease. Subsequent mutations arising 

20 in rthl strains are duplications resulting from a novel mutagenic process, 
and are not due to a defect in mismatch repair (Tishkoff et al 1997). 
During replication of the lagging strand, DNA polymerase extends into the 
downstream "Okazaki" fragment and displaces it, resulting in a 5' "flap" 
structure that is normally removed by RTHlp. In the absence of this 

25 exonuclease, extensive strand displacement synthesis occurs resulting in 
the duplication of DNA sequences. Recently, it has been shown that 
expansion of CAG repeat tracts are frequent in yeast strains defective in 
Okazaki fragment maturation (Schweitzer et al 1998). This result 
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supports the hypothesis that tract expansions that occur during passage of 
human disease allele results from excess DNA synthesis on the lagging 
strand of replication. 

Disruption of RTH1 



The rthl::KanMX4 disruption strain was constructed using standard 
techniques. RTH1 was amplifyed by PCR using primers . 3g0 5'- 
GTTAACAGTGACTTTCGGTGACAGATA-3 ' . 354 (numbers in subscript 
indicate positions relative to the start codon of RTH1) and 5'- 
GTTAACAAAGCTAGGTGTCGA AGG-3 ' 1424 (extra nucleotides were 
added to the 5'-end of this primer creating a Hpal site as underlined). 
These primers were made in-house at the 40 nmole scale. 

Conditions for amplification were as follows: 50 ng YN94-1 genomic 
DNA template, 0.8 pi 25mM dNTPs (dATP, dCTP, dGTP, dTTP, 
Pharmacia) mix, lpl each primer (200 pmole), 10 pi lOx Amplitaq® 
(Applied Biosystems) buffer, 2 units Amplitaq® enzyme made up to 100 pi 
with dist. H 2 0 and overlaid with paraffin oil. A Hybaid thermocycler 
was used. Programme: 92°C for 2 min for 1 cycle; then 92°C for 2 min, 
50°C for 3 min, 72°C for 3 min. for 30 cycles. The reaction was then 
precipitated using 10 pi sodium acetate pH5.2 and 200 pi ethanol for 30 
"min on ice. The~DNA was then centrimged at 15000g for 20 min. The" 
pellet was then washed in 75% ethanol and dried in vacuo before being 
resuspended in 30 pi H 2 0. 



The resulting PCR product was cloned into S/ral-digested pUC19 
(Yannisch-Perron et al. 1985) creating plasmid pKLRTHl. A 1481 bp 
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S/naI/£coRV-derived fragment containing the KanMX4 cassette was 
removed from pFA6-KanMX4 (Wach et al. 1994) and cloned into the 
EcoRV site of pKLRTHl (+66 relative to the RTH1 start codon) creating 
disruption plasmid pKLrthl::KanMX4. The rthl disruption cassette was 
5 released by Hpal digestion and then transformed into YN94-1. For 
selection of transformants geneticin (G418, Life Technologies) was added 
to the media at a final concentration of 200 pg/ml. Putative yeast 
disruptions were checked by Southern hybridization for the disrupted gene 
on chromosome XI and for the rthl temperature sensitive phenotype 
io (Sommers et al. 1995). 

Figure 16. Southern blot analysis of YN97-167 (rthl::KanMX4) 

Southern blot analysis of rthl genomic DNA (Fig. 16A) and the 
15 corresponding disruption scheme (cf. Fig. 16B). Lane (1) EcdBl digested 
control YN94-1 DNA. One band is seen when genomic DNA is 
hybidized with the 316 bp PCR probe. This 3794 bp EcoRI fragment 
corresponds to the wild type RTH1 gene on chromosome XI. Lane (2) 
£coRI digestion of genomic DNA containing disrupted rthl. A shift in the 
20 3794 bp band is seen to 2420 bp corresponding to the insertion of the 
KanMX4 cassette (this cassette contains an £coRI restriction site at its 3'- 
end). 

(e) POL30 

25 

The POL30 gene of Saccharomyces cerevisiae encodes PCNA 
(proliferating cell nuclear antigen), an essential component of the DNA 
replication machinery. PCNA exists in vivo as a homotrimer that acts as a 
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"sliding clamp" around the DNA helix thereby increasing the 
processitivity of DNA polymerases 5 and s. It has been shown that 
PCNA interacts strongly with the MSH2p/MSH3p heterodimer during 
mismatch repair (Johnson et a/.,1996) Yeast strains carrying the pol30- 
104 mutation show a dramatic increase in spontaneous mutation and tract 
instability, (cf. Table la). Plasmid pCH1577 (Amin & Holm, 1996) 
containing the pol30-104 allele marked with the LEU2 gene was digested 
to completion with Sad. The reaction products were transformed directly 
into S. cerevisiae YN94-1. Putative disruptions were patched and first 
checked for the cold sensitive phenotype (inability to grow at 15 C) then 
by Southern hybridization analysis and finally by sequencing which 
revealed the expected nucleotide exchange of a C to a T at position 752 
which results in the replacement of alanine by valine in the translated 
product. 



Figure 17. Southern blot analysis of YN98-3 (pol30-104::LEV2) 

Southern blot analysis of poBO-104 genomic DNA (Fig.l7A) and the 
corresponding disruption scheme (cf. Fig. 17B). Lane (1) EcoRV digested 
control YN94-1 genomic DNA. When hybridized with a LEU2 probe two 
bands are seen (2861 and 961 bp corresponding to the wild type LEU2 
gene on chromo some III. Lane (2) EcoRV digestion of genomic DNA 
-cmm^-poifO-W4 allele. The LEU2 probe hybridizes to four 
fragments, firstly the two wild type LEU2 fragments as above and 
secondly two fragments (appearing as a doublet of 2067 and 2076 bp) 
corresponding to the LEU2 marker inserted upstream of the POL30 locus 



(f) 



Amsh2 - deletion of MSH2 in an a - type yeast strain 



26 



WO 98/45476 



PCT/GB98/00869 



In general, it can be said that mutations affecting repeat stability are either 
dependent or independent of the MSH2p repair pathway. Mutations of 
MSH2 together with one or more genes within this epistatic group show a 
5 mutator phenotype no greater than the single MSH2 mutant alone. 
Mutations that are independent of this epistatic group together with the 
msh2 allele show an additive or even a multiplicative effect with regard to 
the mutator phenotype. 

10 We have deleted MSH2 in an a-type yeast strain, which, when crossed 
with any a-type mutant yeast strain will allow the combination of the 
mutant and msh2. In this way, it can be ascertained whether the primary 
mutation is dependent or independent of the MSH2p pathway. 

15 PCR-mediated disruption of MSH2 by SFH 

The SFH disruption cassette (Wach et al. 1994) was amplified from pFA6- 
KanMX4 by PCR using Amplitaq® (Applied Biosystems). Two primers 
were designed consisting of 40 bases of flanking sequence of MSH2 and 
20 20 bases complementary to the KanMX4 cassette. 

The primers 
SFH1: 

5 ' -TTATCTGCTGACCTAAC ATC AAAATCCTCAGATTA AAAGTAG 
25 CTGAAGCTTCGTACGCTG -3' 
SFH2: 

5'-TATCTATCGATTCTCACTTAAGATGTCGTTGTAATATTAAATA 
GGCCACTAGTGGATCTG -3' 
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Conditions for the amplification were as follows: 50 ng pFA6-KanMX4, 
0.8 m 25 mM dNTPs mix, 1 ul eac h primer (200 pmole), 10 p.1 lOx 
Amplitaq® (Applied biosystems) buffer, 2 units Amplitaq® made up to 100 
5 p.1 with dist. H*0 and overlaid with paraffin oil. For PCR programme 
see "disruption of RTH1 The PCR product was then precipitated using 
10 nl 3 M sodium acetate pH 5.2 and 200 pi ethanol for 30 min on ice. 
The DNA was the centrifuged at 15,000 g for 20 min. The pellet was 
then washed in 75% ethanol and vacuum dried before being resuspended 
io in50plofdist. H 2 0. Transformation of yeast was carried out according 
to the method of Gietz and Woods (1994). For selection, geneticin 
(G418)-supplemented medium was used. Putative yeast disruptions were 
checked by Direct Colony PCR (Pearson and McKee, 1991) using the 
following primers. 



15 



Checking primer 1 : 

-269 5'- AGCACTCCGTATAAACAAAG -3' - 250 
Checking primer 2: 

3iii 5'- TAGTGACAGTGGAATAAAGG -3' 3130 
20 (Numbers in superscript indicate positions relative to the MSH2 start 
codon). 

On checking by PCR a wild type MSH2 strain gives a product of 
approximately 3399 bp whereas strains in which MSH2 has been disrupted 
by the KanMX4 cassette yield a product of approximately 2016 bp. 

25 

Figure 14. Southern blot analysis of YN97-11 (mlhl::LEU2) 

Lane (1) EcoRV digested control YN94-1 DNA. Two bands are seen 

28 



WO 98/45476 PCT/GB98/00869 

(approximately 1000 and 3000 bp) corresponding to the wild type LEU2 
gene on chromosome III. Lane (2) EcoRV digests of yeast genomic DNA 
containing disrupted MLH1. The gel was probed with a 2200 bp LEU2 
fragment. Three bands are seen in the disruptant, two corresponding to 
5 the wild type LEU2 on chromosome III and the other larger band 
corresponding to the LEU2 gene used to disrupt MLH1 on chromosome 
XIII (approximately 5800 bp). 

METHODS 

10 

1 ft CPU TR ANSFORMATION USTNO THF "CALCIUM 
CHLORIDE " METHOD 

(Mandel&Higa 1970) 

15 Solutions: 

0.1 MMgCl 2 
0.1 M CaCl 2 

20 Protocol: 

An overnight culture was diluted (1:100) in 50ml of LB medium and 
grown at 37°C with shaking until OD 650 « 0.3-0.35 (2-2.5 hr approx.). 
The culture was divided into 2x25ml tubes and incubated on ice for 20 
25 minutes. Tubes were spun at 4°C for 3 minutes at 3500rpm in a 
centrifuge (Sigma 4kl0). The supernatant was discarded and 20ml 0.1 M 
MgCl 2 was added, vortexed and then immediately spun. 10ml of sterile 
ice-cold CaCl 2 was added and left on ice for 10 minutes. Tubes were spun 
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and pellet was resuspended in 2ml 0.1 M CaCl 2 . Cells were then frozen 
at -70-C or used directly for transformation. For transformation, lOOul 
of cell suspension was added to a pre-cooled reaction mbe together with 
4|il (l-10ng) of transforming DNA. This was then left on ice for 30 
minutes. Cells were then heat-shocked for 45 seconds at 42°C and then 
incubated on ice for 4 minutes. 1ml of LB medium was then added and 
the cells were incubated at 37X for 1 hour. This was then plated direcdy 
onto selective media LB plates and incubated overnight at 37°C. 

2 YEAST TR A NSFOFM \ TIQN — UgIM£__JHE_ "PLATjg" 
METHOD 

(Elble, 1992) 
Solutions: 



PLATE: 90ml 45 % PEG 4000 
10ml 1M Li-acetate 
lml 1 M Tris-HCI, pH 7.5 
0.2ml 0.5 M EDTA 

Protocol: 



The yeast strain was streaked out for single colonies onto YEPD agar and 
incubated overnight at 30°C. A single colony was picked and grown 
overnight in 10ml YEPD at 30°C with shaking. 0.5ml of the culture was 
spun down in a microcentrifuge (Sigma 112) and the supernatant was 
decanted by inversion. 10ul of carrier DNA (lOOpg) and lug 
transforming DNA was then added (no transforming DNA was added for 
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control) and vortexed. 0.5ml PLATE was then added, vortexed and 
incubated overnight at room temperature. Cells were then pelleted, 
washed and resuspended in 125p.l glass dist. H 2 0. This was then plated 
directly onto selective media and incubated at 30°C for 3 days. 

5 

3. RAPID ALKALINE EXTRACTION PROCEDURE EOtt 

SCREENING RECOMBINANT PLASMTD TWA 

(MINISCREENS1 ; 
(Birnboim & Doly. 1979) 

10 

Solutions: 

Resuspension buffer: 50 mM Tris-HCI 

10 mM EDTA, pH 8.0 
15 Lysis buffer: 200 mM NaOH 

1% SDS 

Neutralization Buffer: 3 M Na acetate, pH 5.5 
Isopropanol 

20 Phenol/Chloroform/isoamyl alcohol (25 :24 : 1 ) 
Chloroform/isoamyl alcohol (24:1) 
Na-acetate 3 M 
Ethanol 100% and 70% 

25 Protocol: 

1.5ml of an overnight culture was spun in a bench centrifuge (Sigma 112) 
for 3 minutes and the supernatant decanted. 200(il of resuspension buffer 

31 



WO 98/45476 PCT/GB98/00869 

was then added and vortexed. 200*11 of lysis buffer was then added and 
the tubes inverted six times 200pl of neutralization buffer was added 
immediately and the tubes inverted six times. This was then spun for 15 
minutes in a bench centrifuge and the supernatant transferred to a new 
5 tube. DNA was then precipitated with 0.6 vol. isopropanol, spun and the 
pellet washed with 70% ethanol. This was then dried in vacuo and 
resuspended in 50u.l glass dist. H 2 0. 

4 PREPARATIONS OF VFAST GRNOMTP my A 

10 (Sherman et al. 1982) 

Solutions: 

Sorbitol/Tris/EDTA: 1.2 M sorbitol 
15 10 mM EDTA 

0.1 MTris-HCI, pH 8.0 
Dithiothreitol (DTT) 1 M 
Zymolyase 20T, 5mg/ml (ICN Biomedicals Inc., Oxford, UK) 
RNase A, lOmg/ml (Sigma, Dorset, UK) - boiled for 10 minutes, then 
20 snap cooled. 

Pronase E, lOmg/ml (Sigma, Dorset, UK) 
SDS 10% 

Phenol/chloroform/isoamyl alcohol (25:24:1) 
Chloroform/isoamyl alcohol (24:1) 
25 Na-acetate 3 M 
Isopropanol 
Ethanol 75% 
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Protocol: 

An individual yeast colony was used to inoculate 10ml SCD-U and grown 
up overnight at 30°C with shaking. The culture was then spun at 

5 3000rpm for 3 minutes and resuspended in 5ml sorbitol/Tris/EDTA, 10|il 
DTT was added and the culture incubated at 30°C for 30 minutes. The 
cells were pelleted as before and resuspended in 0.5ml 
sorbitol/Tris/EDTA. 10\i Zymolyase 20T and I0\i\ RNase A were added 
and incubated at 30°C. I0\i\ pronase E and 50|il SDS were added and 

10 incubated at 37 °C for 2 hours. One phenol/chloroform/isoamyl alcohol 
and three chloroform/isoamyl alcohol extractions were then carried out. 
0.1 vol. 3 M Na-acetate and 0.6 vol. isopropanol were then used to 
precipitate the DNA which was pelleted and washed with 70% ethanol. 
This was then air dried and dissolved in lOOjil glass dist. H 2 0. 

15 

5. RESTRICTION DIGESTS 

(Maniatis et al 1989) 

Solutions: 

20 

Restriction Buffers (working concentrations) (Life Technologies Inc., 
Paisley, UK): 

REact™l 50 mM Tris-HCI (pH 8.0) 
25 10 mM MgCl 2 



REact 2 



50 mM Tris-HCI (pH 8.0) 
10 mM MgCl 2 
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50 raM NaCl 



REact 1 M 3 50 mM Tris-HCI (pH 8.0) 
10 mM MgCl 2 
5 100 mM NaCl 

REact™6 50 mM Tris-HCI (pH 7.4) 
6 mM MgCl 2 
50 mM KCI 
10 50 mM NaCl 

Protocol: 

For purification of DNA fragments, restriction digests were performed in 
15 a total volume of 30ul. 20^1 of DNA (0.2-lug) was digested with 2jil of 
enzyme, 3|al of lOx restriction buffer and 5ul glass dist. H 2 0. The 
reaction was incubated according to the manufacturer's instructions for a 
minimum of 3 hours. For restriction analysis, digests were performed in 
a total volume of lOpl. lpl of DNA was digested with lpl of enzyme, 
20 lfil of the appropriate buffer and 7ul glass dist. H 2 0. This was then 
incubated according to the manufacturer's instructions for 1 hour. 
Enzymes were then inactivated by heating at 60 °C for 10 minutes. 



6 MiCPVKRY QE DNA FRQM LOW-MEl .timh. 

25 TEMPERATURE AGAROSE 

(Weislander, 1979) 
Solutions: 
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Tris-borate (lxTBE) buffer: 0.089 M Tris-base 

0.089 M boric acid 
0.002 M EDTA pH 8 

5 

Ethidium bormide (working cone. 0.5 mg/ml) 
Tris-HCI 20 mM pH 8 
EDTA 1 mM pH 8 

Phenol/Chloroform/isoamyl alcohol (25 :24 : 1) 
10 Chloroform/isoamyl alcohol (24:1) 
Isopropanol 
Ethanol 70% 



Low-melting-temperature agarose (Life Technologies Inc., Paisley, UK). 

15 

Protocol: 

0.25g of low-melting agarose was dissolved in 25ml lxTBE and heated to 
70 °C. The gel was then cast and allowed to set at 4°C. DNA samples 

20 were then loaded into the gel and electrophoresis was carried out at 60v. 
The gel was then stained for 30 minutes in ethidium bromide and the band 
of interest was localised using a long-wave-length (300-360 mn) uV lamp 
to minimize damage to the DNA. The band was then cut out of the gel 
with a flamed scalpel and added to 5 vol. 20 mM Tris-HCI and 1 mM 

25 EDTA. This was then heated at 65 °C to melt the gel. DNA was then 
extracted using standard procedures and redissolved in 50jjl1 glass dist. 
H 2 0. 
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Solutions: 

5 

T4 DNA ligase (Promega-Biotec, Madison, USA) 

lx ligase buffer (Promega-Biotec, USA) : 50 mM Tris-HCI, pH 7. 8 

10 mM MgCl 2 

10 mM dithiothreitol 
10 1 mM ATP 

25 mg/ml bovine serum 

albumin 

Protocol: 

15 

Ligations were carried out in a total volume of lOp.1. Approximately 
O.lug of vector DNA and the appropriate amount of insert DNA was 
added to a sterile microfuge tube. Reaction was made up to 7.5 U 1 with 
glass dist. H 2 0. For cohesive end ligation, the reaction was wanned to 
20 65 °C for 5 minutes, 37°C for five minutes, room temperature for five 
minutes, then 4°C for five minutes. luJ ligase buffer and lp.1 T4 ligase 
was added and the reaction was then incubated overnight at 20°C for 
cohesive ends, or at 10°C for the ligation of blunt-ended DNA. l-4pl 0 f 
the ligation mixture could then be used for transformation off. coli. 

25 

8 SOUTHERN HYBRIDTZ A TTON 

(Southern, 1975, ECL™ Technical Manual, Amersham, UK, 1951) 
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This direct nucleic acid labelling and detection system is based on 
enhanced chemiluminescence. The system involves directly labelling 
probe DNA with the enzyme horseradish peroxidase, achieved by 
denaturing the probe so that it is in single stranded form. Subsequent 
5 addition of glutaraldehyde causes the formation of chemical cross-links so 
that the probe is covalently labelled with enzyme. Reduction of hydrogen 
peroxide by the enzyme is then coupled to a light producing reaction 
involving luminol, which on oxidation produces blue light which can be 
detected on light sensitive film. 

10 

Solutions: 



15 



Tris-borate TBE buffer: (cf. earlier) 
Agarose (Life technologies Inc., Paisley, UK) 
0.25 M HCI 
Denaturation solution: 



Neutralization solution: 



20 20xSSC 



Hybridization buffer 0.2 M: 



1.5MNaCL 
0.5 NaOH 
1.5MNaCl 

0.5 M Tris-HCI pH 7.5 
3 M NaCl 

0.3 M Tri-sodium citrate pH 7.0 
(Amersham Life Sciences, ECL Gold 
hybridization buffer, with 5% (w/v) 
blocking agent and 2.2 M NaCl) 



25 SDS 1% 



Hybond-N (Life Science, Amersham, UK) 
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ECL Nucleic Acid Labelling and Detection Kit (Life Science, Amersham, 
UK). 



Protocol: 

Pre-digested genomic DNA was run on a 1 % agarose gel in lxTBE. The 
gel was then immersed in 0.25 M HCI and agitated for 7 minutes. This 
was then rinsed twice with dist. H 2 0 and immersed in denaturation 
solution for 30 minutes. After rinsing with deionised H 2 0 the gel was 
immersed in neutralization solution for 30 minutes. The DNA was then 
transferred to Hybond-N with 20xSSC using a vacuum blotter (Appligen, 
Durham, UK) at 60 mbar for 1 hour. DNA was cross-linked onto the 
membrane in a Stratalinker 2400 (Stratagene, Cambridge, UK), using the 
autocrosslink mode (120,000 njoules/30 sec). The blot was then placed in 
15 a glass hybridization tube (80 x 200 mm, Techne, Cambridge, UK), and 
25ml hybridization buffer was added. The blot was then pre-hybridized 
for 15 minutes at 42°C. 



10 



20 



9 PREPARATI ON OF PRORF f 

The DNA fragment to be labelled was diluted to a concentration of 
lOng/ul in glass dist. H 2 Q. lOOng (lOpJ) was then denatured by boiling 



for 5 minutes. The DNA was immediately cooled on ice for 5 minutes 
and briefly spun in a microcentrifuge. 10[il of ECL DNA labelling 
25 reagent was added to the cooled DNA and mixed. 10p.l of ECL 
glutaraldehyde was added and mixed. This was then incubated at 37°C 
for 10 minutes. The probe was then added to the hybridization solution at 
a final concentration of lOng/ml. 
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The blot was hybridized overnight in a Techne-oven (Hybridizer HB-1D, 
Techne, Cambridge, UK) at 42°C and then washed at 65°C with 2x SSC 
and 1% SDS. A second wash was carried out with 0.5x SSC and 0.1% 
5 SDS. The blot was then exposed to luminescence detection film 
(Hyperfilm-ECL, Amersham, UK) for various lengths of time. 

10. DIRECT COLONY PCR « 
(Pearson & McKee, 1992) 

10 

Solutions: 

Glass dist. H 2 0 

Tag polymerase buffer lOx 
15 (Perkin Elmer, Cheshire, UK) 500 raM KCI 

100 mM Tris-HCI pH 9 
1% Triton X-100 

Taq polymerase (Perkin Elmer, Cheshire, UK) 

Primer: LacZ Reverse 1 :50 dilution 
20 Primer: YEp Forward 1:50 dilution 

dNTPs, 25 mM 

Sterile Oil 

PCR purification resin (Promega Biotech, Madison, USA) 
25 Protocol: 

Single yeast colonies were picked with a sterile tip and mixed with \Q\i\ 
glass dist. H 2 0 in a 0.5ml reaction tube and placed on ice. A mix 
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containing lOOul polymerase buffer, 780ul H 2 0, 8ul dNPTs, 4ul of each 
primer and 5pl Taq polymerase was then made up and vortexed briefly. 
90p.l of this mix was added to each reaction tube and sterile oil was added 
to prevent evaporation. 

PCR Programme (Thermal reactor, Hybaid, Middlesex, UK): 
92°C - 2 min 
1 cycle 

92°C - 2 min 
45°C - 3 min 
72°C - 2 min 
30 cycles 

The majority of the oil was pipetted off. The PCR product was purified 
using PCR purification resin according to the manufacturer's instructions 
and redissolved in lOOpJ glass dist. H 2 0. 

The reaction products were identified by running 15pl of the assay on a 
2% agarose gel in lxTBE. 

11. 3£EA£T_(3- G AL A CTOSTO A gjjj A.<yAv 

(Miller, 1972, Guarente, 1983) 

Solutions: 



Phosphate buffer (KPP) 0. 1M: 



1 M KH 2 P04 
1 M K 2 HP04 
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Mix both until pH 6.5, then dilute 1:10 



60 mM Na 2 HP0 4 
40 mM NaH 2 P0 4 .H 2 0 
10 mM KCI 
1 mM MgS0 4 

Na 2 C0 3 1 M 

P-mercaptoethanol (Millipore, Bedford, UK) 
10 0-Nitrophenyl P-D-Galacto-pyranoside (ONPG) (Sigma, Dorset, UK): 
4mg/ml 

0.10-0.11 mm acid-washed glass beads (B. Braun Melsungen, Germany) 
Protocol: 

15 

10 ml overnight yeast culture in SCD-U was spun for 3 minutes at 
SOOOrpm (Sigma 5kl0). The supernatant was decanted and the pellet 
washed twice with KPP buffer. The pellet was then frozen at -20°C for 3 
hours to "crack" the cells. The pellet was placed on ice and the same 

20 volume of glass beads was added, this was then vortexed at maximum 
speed for 1 minute. 1.2ml of Z buffer was then added and vortexed. This 
was then spun in a pre-cooled centrifuge (4°C) for 5 minutes at 3000rpm 
and the extract decanted into fresh chilled tubes. For the assay 25|il of 
extract was then added to 1ml Z buffer + p-mercaptoethanol (175|j.l P- 

25 mercaptoethanol in 50 ml Z buffer) and wanned to 30 °C (control; 1 ml Z 
buffer / p-mercaptoethanol + 25^1 Z buffer). 200^1 ONPG soln was 
added and the time measured for a yellow colour to appear. The reaction 
was then stopped with 500^1 M Na 2 C0 3 soln and the OD 420 measured 
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12 DETERMINATION OF VOT TIM E ACTTVTTV mfj un > m 
CELL EXTRACT^ 

5 

Volume activity was then calculated for each sample using the following 
equation: 

VOLUME ACTIVITY = 1000 x V T x OD 420 

10 4.5 x V E x dt 

Where V E = extract volume (25fil) 

V T = total reaction volume (1725(il) 
dt = time (min) 

15 

Unit Definition (U) = one unit (U) of p-galactosidase hydrolyses 1 

nMol ONPG per minute under the above 
conditions. 

20 13 PPTF.RMINATTO N OF TOTAL PROTFTN T N wjj 
EXTRACTS 



To determine specific p-glactosidase activity, total cell extract protein 
must be quantified. 

25 

Solutions: 



Bradford solution (Bio-Rad, Munich) 
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Protoc I: 

25|il cell extract was made up to 800(il with glass dist. H 2 0. 200p.l of 
5 Bradford solution was then added, vortexed and incubated at room 
temperature for 10 minutes. OD 595 was then measured (Ultraspec 2000, 
Pharmacia Biotech, St. Albans, UK) and protein concentration (mg/ml) 
was determined using a bovine serum albumin calibration curve. 

10 Determination of specific activity (U/mg) in cell extracts 

Specific activity of the cell extracts could then be calculated using the 
following equation: 



15 SPECIFIC ACTIVITY = 1000 x V T x OD 

4.5 x V E x dt x Cp 



25 



420 



Where V E = extract volume (25\x\) 

V T = total Reaction volume (1725(il) 
20 Cp = protein Concentration (mg/ml) 

dt = time (min) 

Unit Definition (U) = one unit (U) of P-galactosidase hydrolyses 1 

• nMol of ONPG per minute under the above 
conditions. 



14. GEL PURIFICATION OF OLIGONUCLEOTIDES 



Solutions: 
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4% polyacrylamide gel: 2.5ml glycerol 

2 . 5ml acrylamide/ N , N ' -Methy lenebisacrylamide 
40% (w/v) (Millipore, Bedford, UK) 
5 5ml 5xTBE 

15ml H 2 0 

200jxl 10% (w/v) ammonium persulphate (APS) 
12^1 N,N,N' ,N'-Tetra-methyl-ethylenediamine 
(TEMED), (Bio Rad, Herts UK) 

10 

Protocol: 

50|ig of each oligo was run on a 4% acrylamide gel in lxTBE at 100 
volts. Single stranded DNA bands were shadowed by UV using a 
15 fluorescent TLC plate and cut out. DNA was eluted from gel fragments 
into 200^1 glass dist. H 2 0 by shaking overnight and precipitated using 
standard procedures. 

15. DNA SEQUENCING 

20 (Sanger et aL, 1977) 

Protocol: 



Nucleotide sequences were determined by automated DNA sequencing 
25 based on the chain-termination method using the ABI 373A sequencer 
(Applied Biosystems, Foster City, California, USA). Double stranded 
DNA was sequenced using the 'Taq DNA polymerase dideoxy terminator 
cycle sequencing kit 1 (Applied Biosystems), with primers; lacZ Reverse 
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EXAMPLE I: 

Construction of preferred yeast assay p lasmids 
5 P Ka3-9(32-l), P KaCEN(32-l) & pKaINT(32-l) 

The preferred system utilises a unique group of yeast vectors containing 
the bacterial lacZ gene (minus the promoter and first 7 codons) fused to 
the first 29 codons and promoter region of the yeast gene PRS3 (5- 

io phospho-ribosyl-l(cc)-pyrophosphate synthetase) (Carter et al 1994). This 
functional gene fusion was preferably knocked out-of-frame (-1 reading- 
frame) by the insertion of a poly d(AC) 16 tract at the EcdKL site 
downstream of the PRS3 promoter and within the coding region initiated 
from the ATG of PRS3. The resulting PRS3'flacZ gene fusion containing 

15 this out-of-frame insertion was inserted into each of three yeast vectors: a 
high copy vector, a low copy vector and a single copy integrated into the 
yeast genome. 

l(i) pKa3-9(32-l) (high copy construct) 

20 

The initial PRS3'/lacZ fusion was constructed by the insertion into the 
multiple cloning region of YEp356R (Fig. 2, Materials and Methods) 
(Myers, A.M., et al. 1986) of a 371 bp DNA fragment comprising the 
promoter region and regulatory elements of the yeast gene PRS3. This 
25 371 bp Hpal/Clal fragment (-284 - +85 relative to the PRS3 start codon) 
was rendered blunt and the resulting fragment ligated into the unique Smal 
site of YEp356R, creating plasmid pSS3-9. Oligonucleotides 1 and 2 (cf . 
Materials and Methods) containing 16 repetitive AC or TG units were 
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annealed and cloned into the EcoRl site situated within the multiple 
cloning region of pSS3-9 (50fig of each oligonucleotide was purified on a 
4% acrylamide gel and redissolved in glass dist. H 2 0. 200ng of each 
olinucleotide pair were then annealed in glass dist. H 2 0 by heating to 

5 90 °C and allowing to cool slowly to room temp. pSS3-9 was cut to 
completion with EcoM in buffer REact™3 and subsequent ligation was 
carried out with 100-fold excess of insert to vector at 12 °C (cf. Materials 
and Methods)). Ligations were transformed directly into S. cerevisiae 
YN94-1 using the "plate" method (cf. Materials and Methods) and 

10 resulting colonies were screened for the presence of insert by "direct 
colony PCR" (cf. Materials and Methods). Insertion of this repetitive tract 
creating assay plasmid pKa3-9(32-l) resulted in shifting the coding region 
of the PRS3'/lacZ fusion to the -1 reading frame. Also contained within 
plasmid pKa3-9(32-l) are sequences from the yeast 2 micron origin of 

15 replication which allows this construct to exist episomally at a copy 
number of 50-150 per cell. pKa3-9(32-l) also contains the ColEl origin 
permitting replication in Escherichia coli. Selection for the assay plasmid 
is allowed by the incorporation of the ampicillin resistance gene of E. coli 
and the URA3 gene of yeast respectively (see. Fig. 5). 

20 

l(ii) pKaCEN(32-l) (low copy construct) 

Construction of this low copy centromere-based plasmid involved removal 
of the PRS3'/lacZ fusion containing the poly d(AC/TG) 16 tract from pKa3- 
25 9(32-1) on a 3680 bp BamHlfNsil fragment (cf. Fig. 5). Initially pKa3- 
9(32-1) was digested to completion with BamHl and Nsil in restriction 
buffer REact™3 (Materials and Methods). KCI was added to adjust the 
salt concentration to that of REact™6 and digestion with Seal was then 
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carried out to eliminate incorporation of the smaller BamEI/Nsil fragment. 
The resulting 3680 bp fragment containing the PRS3-'lacZ fusion was gel 
purified (cf. Materials and Methods). pRS416 was also digested to 
completion with NsiL and BamHl in buffer REact™3 with the aim of 

5 removing the existing lacZ region on a 923 bp fragment. The resulting 
3975 bp vector band containing the ARS and centromere sequences was 
gel purified and ligated to the 3680 bp fragment of pKa3-9(32-l) 
containing the PRS3/LacZ fusion. In this way the 7655 bp plasmid - 
pKaCEN(32-l) (Fig. 6) was generated. The ligation mixture was then 

io transformed into E. coli DH5cc (cf. Materials and Methods) and placed 
onto LB + ampicillin plates. DNA from resulting colonies was purified 
using the rapid alkaline extraction procedure (cf. Materials and Methods) 
and screened by restriction analysis. The low copy centromere-based 
plasmid exists episomally at levels of 1-5 copies per cell (see Fig. 6). 

15 

l(iii) pKaINT(32-l) (single copy integrative construct) 

Construction of pKaINT(32-l) involved removal of the PRS3'/lacZ fusion 
containing the poly d(AC) 16 tract from pKa3-9(32-l) on a BamUI/Nsil 

20 fragment. This fragment was subsequently cloned into BamWIINsil 
digested integrative plasmid YIp352 (cf. Fig. 4, Materials and Methods 
(Myers et al. 1986)). Before pKaINT(32-l) could be integrated into the 
yeast genome it was linearised by restriction at its unique Ncol site located 
within the URA3 gene. Integration then occurred by homologous 

25 recombination between the URA3 gene of pKaINT(32-l) and the genomic 
URA3 locus of S. cerevisiae YN94-1 (Fig. 8). The resulting URA3 strain 
was checked by Southern hybridization to confirm the insertion of the 
integrative plasmid (cf. Materials and Methods). (See Figs. 7, 8, 9 and 



47 



WO 98/45476 PCT/GB98/00869 
10). 

2. Southern blot analysis of integrated vector 

5 Southern hybridization was used to confirm the integration of pKaINT(32- 
1) into the yeast genome (cf. Materials and Methods). A 423 bp EcoRI 
derived restriction fragment from pKaINT(32-l) containing the yeast 
PRS3 promoter region was used to probe YN94-1 genomic digests (Fig. 
7). In 5 clones transformed with pKaINT(32-l), probing BamHl/Nsil- 

io restricted yeast genomic DNA revealed two bands; firstly, a 2719 bp 
fragment arising from the wild type PRS3 promoter region on 
chromosome VIII, and secondly a 3680 bp fragment corresponding to the 
engineering PRS3 promoter of pKaINT(32-l), integrated at the URA3 
locus on chromosome V. Therefore the plasmid has been stably integrated 

15 into the genome (Figs. 8, 9 and 10). 

From the Southern blot (Fig. 10) we see that lane 6 gives the same 
banding pattern as the control YN94-1 in lane 1. This pattern may be due 
to recombination between the repeated URA3 genes on chromosome V. In 
20 this way the integrated vector is lost and the URA3 mutation reverts to 
wild type, hence growth on selective media and loss of the 3680 bp band 
on theblot. 



EXAMPLE 2: 

25 Sensitive screening system in S. cerevisiae utilising the liiHfprasP 
reporter gene 

Here we describe the use of an alternative reporter gene in a screening in 
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S. cerevisiae, utilising the light emitting properties of the luciferase gene 
product. The development of this system involves simple sub-cloning 
steps with a fragment containing the luciferase gene derived from "pGL3- 
Promter vector" (Promega-Biotec) (cf. Fig. 15). This fragment is fused to 
our PRS37(AC) 16 tract using the three assay plasmids described above 
which have had the lacZ reporter gene previously removed. DNA 
stability is then monitored by alterations in levels of measured light 
emittance using a Lumat luminometer LB9501 (Berthold, UK) with the 
luciferin substrate obtained from Promega Biotec. 

2(i) Further example of a suitable reporter gene: 

UBA1 — (orotidine-5 '-phosphate decarhnxvia^ ?m e. ^ 

Saccharomvces cerevisiae. 

Isolated and sequenced by Mark Rose, Paula Grisafi and David Botstein. 
(1984); Structure and function of the yeast URA3 gene: expression in 
Escherichia coli. Gene, 29, 113-124. 

SGD Name: URA3/YEL021W (http://genome-www.stanford.edu/) 

Cells in which the URA3 gene is expressed can be selected against by 
growth on plates containing 5-fluoro-orotate (5-FOA). 

EXAMPLE 3: 

Preferred human assay vectors 



Screening system utilising the B-galatosidase reports pe ne for up * in 
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human cell linpg 

This screening system involves the construction of a similar plasmid to 
that used in the yeast system. In this case a fragment containing the 
5 luciferase gene is removed from pGL3-Promter vector (Fig. 15) and 
replaced with a fragment derived from one of the three assay plasmids 
described in Example 1. This fragment derived from the assay plasmid 
contains the poly(AC)j 6 tract fused to the P-galactosidase reporter gene 
minus the yeast promoter and start codon. Cloning of this fragment into 

io pGL3-Promter vector results in an SV40/tacZ fusion containing a 
poly(AC) 16 tract in the open reading frame. Replication in human cell 
lines is allowed through the incorporation of the EBV (Epstein-Barr virus) 
origin of replication derived from plasmid pDR2 (cf. Fig. 16). Selection 
in human cultured cells is by incorporation of a fragment containing the 

15 hygromycin resistance gene also derived from plasmid pDR2 (cf. Fig. 16). 

Rather than yeast cells as described in the previous Examples, in another 
embodiment human cell lines are utilized. 

20 Preferably the cells are derived from tumours of colorectal cancer patients 
or other tumours which exhibit instability in repetitive (microsatellite) 
DNA sequences. Alternatively, human cell lines can be engineered to 
contain mutations in genes implicated in mismatch repair pathways, such 
as hMSH2 and/or hMLHl, using standard mutagenesis techniques within 

25 the knowledge of a skilled person. 

The following vectors are used to transform human cell lines in the same 
way that the vectors of Examples 1 and 2 are used to transform yeast cells 
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according to preferred embodiments of the assay according to the first 
aspect of the invention. 

ex ample; 4: 

5 Assay for testing the carcinogenic properties of a test substance 

Using the test system of the invention the background frequency of 
microsatellite instability associated with wild type (mismatch repair- 
competent) and strains defective in DNA mismatch repair has been 
10 assayed. As shown in Table 1 the yeast strains defective in mismatch 
repair show extensive DNA instability. This situation can be compared to 
the DNA instability observed in cell lines derived from colorectal 
carcinomas. 

15 Table 1: Frequency of alternation in lengths of poly(AC) tracts in 
wild type yeast strains and DNA mismatch repair mutants. 



Strain 


Relevant 


Tract location 


Frequency of 


Frequency 




genotype 




tract 


relative to 








alteration 


wild type 


YN96-57 


wild type 


pKa3-9(32-l) 


1.35X10" 4 


1 


YN97-25 


msh.2 


pKa3-9(32-l) 


6.80xl0" 2 


504 


YN97-28 


mlhl 


pKa3-9(32-l) 


1.14X10' 1 


843 


YN97-31 


msh3 


pKa3-9(32-l 


6.06xl0~ 3 


45 


YN97-22 


wild type 


pKaCEN(32-l) 


l.OOxlO' 5 


1 
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YN97-26 


msh2 


,pKaCEN(32-l) 


5.90xlO" J 


590 




YN97-29 


mlhl 


pKaCEN(32-l) 


6.14xl0' 3 


614 




YN97-32 


msh3 


pKaCEN(32-l) 


1.40xl0" 3 


139 




YN97-23 


wild type 


URA3 locus, 


<5.0xl0" 6 


1 





YN97-27 


msh.2 


chr. V 


1.59xl0 3 


318 




YN97-30 


mlhl 


URA3 locus, 


1.82xl0" 3 


363 




YN97-33 


msh3 '<• 


chr. V 


8.45xl0" 5 


9 








URA3 locus, 












chr. V 









Basal frequency of microsatellite instability of the various genetic 
backgrounds within the yeast assay system 



Using each type of assay construct frequencies of microsatellite instability 
were measured in relation to wild type yeast and disruptions as described 
in Table la. Slippage events responsible for the white to blue colony 
transition were determined by sequencing. 

(I) Wild type 



As shown in Table la the wild-t ype yeast strain for mismatch r^ir . 
YN94-1 - shows a low background white to blue frame-shifting frequency 
when transformed with constructs described above in the context of high 
copy 2 micron-based yeast origin of replication. Frequency of instability 
increases relative to the length of the repetitive tract (obviously a longer 
tract is more likely to undergo mutational change than a shorter one). 
Frequency of instabilities are approximately 10-fold lower in constructs in 
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the context of the centromere-based vectors compared to the high copy 
constructs and approximately 100-fold lower in the single copy integrative 
assay constructs. For the high copy construct containing the (C 1 . 3 A) n 
telomeric tract the frequency of alteration was approximately 3xl0" 5 

5 suggesting that telomeric repeats are much more stable than poly (AC) 
tracts. In this way, the overall sensitivity of the assay system can be 
altered through a choice of plasmid copy number and repetitive tract 
length. Little difference in slippage-frequency was seen between the 
constructs in the -1 reading-frame compared to those in the + 1 reading 

10 frame when measured in the wild type strain, indicating no bias towards 
insertions or deletions. The sequence of the d(AC) tract in plasmids 
rescued from blue colonies was determined as described. As an example, 
for assay constructs containing the poly d(AC) 16 tract (-1 reading-frame) 
the only alteration detected was the loss of one (AC) pair which resulted in 

15 a poly d(AC)i5 tract. This seemed to be irrespective of plasmid copy 
number or tract location. Using this strain (cf. Table la) 14 of the 20 
high copy plasmids in the -1 reading- frame analysed had altered tract 
lengths. The 6 blue colonies with unaltered tracts presumably had a 
mutation elsewhere in the LacZ gene. Alternatively as this is a high copy 

20 construct the subpopulation of altered tracts responsible for the white to 
blue transition may not have been rescued. This latter theory was 
strengthened by the finding that 10 out of the 10 low copy centromere- 
based (-1) plasmids rescued (cf. Table la) contained altered tract lengths. 
For blue yeast colonies containing chromosomally integrated poly(AC) n 

25 tracts a PCR was carried out as described. Four PCR products 
corresponding to (AC) n tracts integrated at the URA3 locus on 
chromosome V were analysed. Sequencing showed that each repetitive 
tract had lost a single (AC) pair. The loss of this (AC) pair resulted in a 
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shift of the coding region from the -1 reading frame back into the correct 
reading frame thereby giving rise to a fimctional gene product. For assay 
constructs containing the poly d(AC) 12 A tract (+1 reading-frame) the most 
frequent alteration detected was the gain of one (AC) pair which resulted 
in a poly d(AC) I3 A tract. Again, this was irrespective of plasmid copy 
number or tract location. In the wild type strain YN94-1 (cf. Table 1) 9 
of the 12 high copy plasmids analysed contained altered tract lengths. 8 of 
these contained one extra (AC) pair, the 9th contained an extra 4 (AC) 
pairs giving a poly d(AC)i 6 A tract. The gain of a single (AC) pair or in 
one case 4 (AC) pairs resulted in the shifting of the coding region of the 
PRS3'/lacZ fusion from the +1 reading frame back into the correct 
reading frame giving rise to a functional gene product. 

(II) mshl 



A 100-600 fold increase in the frequency of microsatellite instability was 
measured in yeast strain YN97-10 carrying a disrupted msh.2 allele (Table 
la). Difficulty was encountered in determining frequencies of slippage in 
this strain carrying high copy plasmids due to the number of mutational 
events observed. The number of slippage events indicated by the white to 
blue colour transition was found to be higher in studies utilising the assay 
constructs in the -1 reading-frame compared to studies utilising assay 
constructs in" the +1 reading-frame, indicating a bias for deletions over 
insertions. This finding is consistent with the previous results of Strand et 
al. (1993) who also found a bias towards deletions for this strain. The 
values clearly illustrate MSH2p as being a key component in the mismatch 
repair pathway with a strategic and unique role in the identification and 
correction of insertion/deletion loop structures in the genetic material. 
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ail) mlhl 

Mutation in the yeast mismatch repair gene MLH1 (Table la) also has a 
5 profound effect on the rate of microsatellite instability within all the assay 
constructs. The degree of instability is in the same order of magnitude to 
that of mshl mutants (approx. 200-800 fold, cf. Table 2). Again 
frequencies of frame-slippage are higher in studies utilising the assay 
constructs in the -1 reading-frame. Although the exact function of MLHlp 
io is unknown these results again illustrate the importance of this gene 
product in the correction of mispaired loop structures. 

(IV) msh3 and msh6 

15 Mutation in the yeast genes MSH3 and MSH6 have less profound effects 
on the rate of microsatellite instability within this yeast system as 
compared to mutations in MSH2 and MLH1 (Table la). Mutation in the 
MSH3 gene causes an approx. 30-fold increase in frame-slippage 
compared to wild type. An approximate 2-fold increase in white to blue 

20 transition was seen in studies using the high copy (-1) construct compared 
to the (+1) construct and a 4-fold increase was seen in the levels of 
slippage using the centromere-based (-1) construct compared to the 
centromere-based (+1) version, again indicating a bias towards deletions 
rather than insertions in this strain. This finding is also consistent with the 

25 previous findings of Strand et aL (1995) who found a bias towards 
deletions for the strain deleted in MSH3. Mutation in the MSH6 gene 
causes an 8-12 fold increase in insertion/deletion events indicating the 
secondary role this mutS homologue plays in the pathway. 
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(V) rthl 



10 



15 



20 



We have shown that strains mutated for the gene encoding this 5'->3> 
exonuclease RTHlp, cause a 20-260 fold increase in the frequency of 
frame-slippage compared to wild type. We have found that this is also 
dependent on the reading-frame of the assay construct used. Previous data 
(Johnson et al. 1995) have shown that mutations in RTH1 cause up to a 
280-fold increase in instability of simple repeats compared to wild-type 
strains. Secondly, tract changes involving single repeats are exclusively 
insertions in the rthl strain consistent with the theory of displacement 
synthesis. Similar to these results we have found a 260-fold increase in 
instability for centromere-based assay constructs in the +1 reading-frame 
and a 20-fold increase for assay constructs in the -1 reading-frame (Table 
1). On sequencing of these tracts we found that in assay constructs 
containing tracts in the +1 reading-frame all alterations were due to the 
insertion of one single repeat unit whereas all the alterations in tracts 
originating in the -I reading-frame showed the loss of one (AC) pair or in 
one case the loss of four (AC) pairs. No insertions were found. These 
results suggest that as well as the prevention of displacement synthesis 
leading to the duplication of DNA sequences, RTHlp may also play a role 
in the prevention of loops occurring in the template strand. 



{Wl)pol30 

25 

The pol30-104 mutation in PCNA causes an approximately 20-100-fold 
increase in the frequency of tract alteration as compared to wild type. The 
effect oipol30-104 on tract alterations, however, is not quite as severe as 
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that of the null mutations in the mismatch repair genes (cf Table la). 
Epistatic analyses of pol30-104 with null mutations in mismatch repair 
genes MSH2, MLH1 and PMS1 have shown that rates of tract instability 
were the same in double mutants of pol30-104 with null mutations in 
5 mismatch repair genes and in single mismatch repair mutants (Johnson et 
aL 1996). Therefore, it can be said that hypermutability in this mutant 
results from a defect in mismatch repair. 

4(i) Determination and verification of frame-slippage events 

10 

When transformed into yeast the preferred three assay plasmids produce 
white colonies on medium containing a chromogenic substrate for p- 
galactosidase and no p-galactosidase activity can be measured (cf. 
Materials and Methods). Frame-slippage within the repetitive poly 

15 d(AC/TG) 16 region due to faulty DNA mismatch repair results in in-frame 
variants which are detected as blue colonies (Fig. 12). The slippage 
events can be verified after plasmids rescued from 5. cerevisiae (cf. 
preparation of yeast genomic DNA, Materials and Methods) are 
transformed into commercially available strains of E. coli. (Materials and 

20 Methods). The actual slippage event was identified by DNA sequencing 
using primer "PRS3 Forward" (cf. Materials and Methods). The 
procedure used was based on the chain-termination method (Sanger et aL 
1977) using the ABI 373A sequencer (Applied Biosystems, Foster City, 
California, USA) (cf. Materials and Methods). 

25 

Sequencing of the repetitive region within the three assay plasmids 
revealed that the blue yeast colony on a plate corresponded to a slippage 
event at the DNA level. The most frequent slippage event was the loss of 
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one (AC) pair, the result of which shifted the coding region of the 
PRS3'lacZ fusion back into the correct reading frame giving rise to a 
functional gene product (Fig. 12). Once sequenced, resulting in-frame 
variants of the high copy, low copy integrative assay plasmids were 
5 measured for specific P-galactosidase activity in yeast (Fig. 12). 

4(ii) Influence of certain food components on DNA instability and 
repair 

10 4(iii) Screening for the potential negative influences of compounds on 
DNA stability and mismatch repair 

As well as being able to screen for the protective (anticarcinogenic) effects 
of human dietary components on the stability of microsatellite DNA, it is 

15 also possible to screen for the negative (carcinogenic) influences of 
compounds using the preferred simple colour selection system in yeast 
according to the first aspect of the invention. When screening for 
carcinogens, a wild type yeast strain for mismatch repair is transformed 
with the high copy assay construct for increased sensitivity. This 

20 transformed strain is then grown in the presence or absence of carcinogens 
and the frequency of white to blue colonies arising from the different 
growth conditions are subsequently compared. Carcinogens increase the 
rafeof ^Mt^tcTblu^transition as compared~to~"normar growth conditions 
(cf. Fig. 1). 

25 

The intercalating agent ethidium bromide and its effect on 
microsatellite instability 
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Ethidium bromide - a known potent carcinogen causes frame-slippage 
through its action of intercalation. This and other intercalating agents 
preferentially target monotonic runs or alternating nucleotide sequences. 
As shown in Figure 17, at higher concentrations (6-7iig/mI) this agent 
5 increases the frequency of frame-slippage 300 fold compared to an 
untreated strain (cf. Figure 1). 

Method 

io Yeast strain YN94-1 was transformed with pKa3-9(32 1) and plated onto 
selective medium (cf. Materials and Methods). A single colony was 
inoculated into 100ml SCD-Uracil (cf. Materials and Methods) and grown 
to mid-log phase. Culture was divided into 10 x 10ml sterile bottles. 
Cells were harvested, washed in 0.1 M potassium phosphate (KPP) buffer 

15 (pH 6.5) then resuspended in KPP buffer at a concentration of 
approximately 10 6 cells/ml. Ethidium bromide was then added to 
concentrations between 0 and 9p,g/ml, and the cells were incubated at 
30 °C with agitation for approximately 7 hours. Following incubation the 
cells were washed with 0.1 M KPP buffer, diluted in 10ml H 2 0 and plated 

20 on SCD Uracil for single washed with 0.1 M KPP buffer, diluted in 10ml 
H 2 0 and plated on SCD Uracil for single colonies (4 plates for each 
ethidium bromide concentration). This experiment was repeated three 
times. 

25 Polyamines and DNA stability 

Polyamines - a group of flexible polycations are normal constituents of the 
cell and are essential for many cellular processes. They are found in high 
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concentrations in red meat, fish and vegetables. Under physiological 
conditions putrescine, spermidine and spermine are protonated and possess 
two, three and four positive charges respectively. Spermine, with its four 
positive charges binds two phosphate groups in each strand of the DNA 
5 helix. This spanning of the major and minor groove by spermine 
stabilises the DNA helix (Heby, O. & Persson, L., 1990). 

The frequency of frame-slippage for the preferred centromere-based assay 
plasmid pKaCEN(32-l) of Example l(i) has been determined in yeast 
10 strains disrupted in steps of polyamine synthesis (Fig. 11 and Table 2). 
Results so far indicate that polyamines are required for the stabilisation of 
DNA during replication and could therefore aid in the prevention of 
frame-slippage. 



15 Table 2: Frequency of alteration in lengths of poly(AC) tracts in wild 
type yeast strains and polyamine mutants. 





Strain 


Relevant 
genotype 


Tract location 


Frequency of 
tract 
alteration 


Frequency 
relative to 
wild type 
















2602 
Y359 
Y362 


wild type 
spel 

spel spel 


pKaCEN(32- 
1) 

pKaCEN(32- 
1) 

pKaCEN(32- 
1) 


6.20xl0~ 6 
1.02X10" 4 
1. 54x10-" 


1 

16.35 
24.79 
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A simple colour assay for monitoring spontaneous base-base 
mismatches in S. cerevisiae. 

5 As described, the mismatch repair pathway of eukaryotes repairs both 
insertion/deletion mismatches and spontaneous base-base mismatches. 
Therefore, using the above yeast strains we can use our assay to monitor 
the genetic and dietary influences on spontaneous base/base mismatches in 
yeast. 

10 

All of the above mentioned yeast strains are derived from the wild type 
yeast YN94-1 available from Dr Michael Stark, Department of 
Biochemistry, University of Dundee, DDI 4HN, Dundee, Scotland, UK 
and Professor Michael Schweizer, Genetics & Microbiology Department, 

15 Institute of Food Research, Norwich Laboratory, Norwich Research Park, 
Norwich, NR4 7UA, England, UK and as such contain the ade2A point 
mutation which affects the biosynthesis of purine nucleotides (Figure. 18). 
The red pigment that accumulates in adel and adel mutants derives from 
an intermediate formed in reaction 5. This step involves the closure of an 

20 imidazole ring by phosphoribosylaminoimidazole synthetase to yield 
phosphoribosylaminoimidazole (AIR). Obviously, wild type ADE2 cells do 
not accumulate this pigment and therefore remain white. Therefore, 
subsequent reversion of red ade - strains to wild type enables us to 
monitor the frequency of spontaneous mutations by a red to white colour 

25 transition. We have subsequently measured the basal frequency of red to 
white reversion in the various genetic backgrounds of this system (Table 
3) 
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Frequency of ADE + reversion in various yeast genetic 
backgrounds 



Frequency of ADE + Rate relative to 

reversion wild type 



wild type 


2.51 x 10" 4 


1 


mshl 


1.60 x 10* 2 


66 


msh3 


4.80 x 10" 4 


2 


msh6 


3.57 x lO -3 


14 


mlhl 


1.87 x 10" 2 


74.5 


rthl 


3.42 x 10' 3 


13.6 


potfO 


6.30 x 10' 3 


25 



15 4(iii) Industrial Applicability 



The identification of human dietary components that protect against DNA 
instability and therefore some types of cancer by use of the present 
invention will contribute to the scientific basis for a healthy diet. The 
20 simple blue/white colour test according to a preferred embodiment can be 
provided in kit-form or scaled up for use in the food or pharmaceutical 
industries. 



Once a test substance is identified as containing "protective" factors, 
25 programmes can be undertaken to characterize and elucidate the mode of 
actin of the protective factor within the foodstuff. The results from the 
assay of the invention should be of enormous value to plant and crop 
breeders who wish to produce foodstuffs of greater nutritional value. 
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It has been observed that drug resistant human ovarian carcinoma cell 
lines aquire a mutator phenotype and a deficiency in hMLHl repair 
activity, with loss of expression of the hMLHl subunit occurring in 9/10 
independently derived cisplatin resistant sublines (Hirst et aL 1997). This 
observation shows that loss of liMLHl expression is a frequent event in the 
development of drug resistance and support the involvement of mismatch 
repair in mediating the cytotoxic action of chemotherapeutically important 
drugs. The assays of the invention include yeast strains carrying this mlhl 
mutation and so could be used in an in vivo study looking at the effects of 
cytotoxic agents and subsequent resistance. 
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CLAIMS 

1. An assay for testing the carcinogenic properties of a test substance 
comprising: 

(i) introducing into cells a reporter gene expression vector 
comprising a repetitive DNA sequence which exhibits instability in cancer 
cells, whereby instability of the repetitive DNA sequence affects 
expression of the reporter gene; 

(ii) exposing the resulting cells to the test substance; and 

(iii) determining whether the test substance is carcinogenic or anti- 
carcinogenic by comparing the frequency of reporter gene expression in 
the resulting cells with the frequency of reporter gene expression in cells 
which have not been exposed to the test substance. 

2. An assay as claimed in Claim 1 wherein the cells are eukaryotic 
cells. 



3. An assay as claimed in Claim 1 or 2 wherein the cells are yeast or 
human cells. 



4. An assay as claimed in Claim 1, 2 or 3 wherein the cells are 
defective in one or more repair processes, which defects lead to repetitive 
DNA~seqilence instability^ 



5. An assay as claimed in Claim 3 wherein the cells are human cells 
containing one or more mutations in the hMSH2 gene on chromosome 2 
and/or the hMLHl gene on chromosome 3. 
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6. An assay as claimed in Claim 3 wherein the cells are yeast cells 
which contain one or more mutations in the MLH1 gene and/or MSH2 
gene and/or PMS1 gene. 

5 7. An assay as claimed in any one of Claims 1 to 6 wherein the 
unstable repetitive DNA sequence comprises poly d(AC) and/or poly 
d(GT). 



8. As assay as claimed in Claim 7 wherein the unstable repetitive DNA 
10 sequence comprises poly d(AC) n and/or poly d(GT) n , wherein n is from 8 

to 60, preferably from 16 to 32. 

9. An assay as claimed in any one of Claims 1 to 8 wherein the 
repetitive DNA sequence exhibits instability in human colorectal cancer 

15 cells. 



10. An assay as claimed in any one of Claims 1 to 9 wherein the 
reporter gene is selected from lacZ, URA3 or luciferase. 

20 11. An assay as claimed in Claim 10 wherein the reporter gene 
comprises the lacZ gene. 

12. An assay comprising: 

(i) testing the carcinogenic properties of a test substance in yeast 
25 cells in an assay as claimed in Claim 3 or Claim 4 or any of their 

dependent claims; and 

(ii) further testing the carcinogenic properties of the test substance 
in an assay in human cells, as claimed in Claim 3 and any of its dependent 
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claims. 



10 



13. An assay for testing the carcinogenic properties of a test substance 
comprising the steps of: 

(i) screening a test substance using a reporter gene expression 
vector as claimed in any one of Claims 1 to 1 1; and 

(ii) repeating the assay using cells containing a high, medium, low 
or single copy number reporter gene expression vector, the high, medium, 
low or single copy number vector being selected depending on the 
frequency of repetitive DNA instability measured in step (i). 
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Yeast Promoter 



Assay plasmid for monitoring frame-slippage caused by faulty 
DNA mismatch repair in Saccharomyces cerevisiae 



Fig.l 



CP 
Cb 



Transformation into Saccharomyces cerevisiae 



Yeast strains defective in the mismatch 
repair pathway (e.g. msh2, msh3, msh6, 
mlh1, pmsl) or other diet-related metabolic 
pathways 



Yeast strains are grown on a defined 
medium in the presence or absence of 
protective dietary components e.g. 
Broccoli extract 
Polyamines: present in meat and vegetables 
- known to play in role in DNA stability 




Plates are incubated at 30 C to allow for 
growth and colour development 



Protective dietary components 



■(■Protective dietary components 
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White yeast colony containing 
original out-of-frame yeast//acZ 
fusion assay plasmid 




Frame-slippage within the poly d(GT/CA) 
tract gives rise to a functional in-frame variant 
of the yeast//acZ gene fusion. Faulty 
mismatch repair results in a white to blue 
transition 
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Hindlll.Sphl.P3tl.Sall.BamHI.Smal.Kpnl.Sacl.EcoRI 




EcoRV3921 



YEp356R (Myers et al. 1986). A yeast/ E. coli shuttle vector suitable for fusing yeast 
promoter and coding sequences to the lacZ gene of E. coll. The 2 micron sequences 
ensure a high copy number of 25-100 per cell. The Amp R and URA3 genes allow 
selection In E. coli and yeast respectively. 



Fig 2 
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BssHII.T3.Sacll.Notl.Xbal.Spel.BamHI.Smal.Pstl.EcoRI.EcoRV.Hindlll.Clal.Apal.Kpnl.T7. BssHII 




Pvul 3322 
Seal 3211 



Apal 1394 

Seal 1458 
Ncol 1563 
EcoRV1582 

Pstl 1790 



ARS1-14/CEN6 



FlS[» 3 pRS41S (Slkorekl, R. & Hleter, P. (1989). A low copy number vector containing 
^ sequences from yeast centromere VI. 
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Fl^» 5 PKa3- 9 (32-1) (High copy construct, see text for details) 



HindlllO 
BamHI22 

iEcoRI::(AC)18423 

PRS3Pro. 



Seal 6837 



Nsll 5267 




EcoRV1S82 



Ndel 3427 



Ndef 4608 
ECORV4339 

Nco,432 ° Seal 4215 



pKaCEN(32-1) (low copy construct) 

Construction of this low copy centromere-based plasmid involved 
removal of the PRSa^llacZ fusion containing the poly d(AC/TG)i 6 tract 



-from— pK-a3-9(32^1-)-on-a-3680-bp-ea-mHr/Ws/rfragment (cf. Fig. 5) 
Initially pKa3-9(32-1) was digested to completion with SamHI and Nsi\ 
in restriction buffer REact™3 (Materials & Methods). KCI was added to 
adjust the salt concentration to that of REact™6 and digestion with 
Seal was then carried out to eliminate incorporation of the smaller 
BamH\/Nsi\ fragment (Fig. ). The resulting 3680 bp fragment containing 
the PRS3 -'lacZ fusion was gel purified (cf. Materials & Methods). 
pRS416 was also digested to completion with Nsll and BamH\ in buffer 
REact™3 with the aim of removing th existing lacZ region on a 923 bp 
fragment. The resulting 3975 bp vector band containing the ARS and 
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pKalNT(32-1) (single copy integrative construct) 

Construction of pKalNT(32-1) involved removal of the PRS3'llacZ 
fusion containing the poly d(AC)i6 tract from pka3-9(32-1) on a 
BamHM Nsi\ fragment. This fragment was subsequently cloned into 
BamH\/Nsi\ digested integrative plasmid Ylp352 (cf. Fig. 4, Materials & 
Methods (Myers ef al. 1986)). Before pKalNT(32-1) could be integrated 
into the yeast genome it was linearised by restriction at its unique 
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pKalNT(32-1) (single copy integrative construct, see text for details) 
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1 pKalNT(32-1) 



URA3' A "P 



PRS3 ^/23 
ColEI Promoter , _ 
ORI k LacZ 



Ncol 



2 S.cerevisiaeYN94-1 



URA3 Locus 
Chromosome V 




URA3 Locus 
Chromosome V 



URA3 



ura3-1 



BamHI 



PRS3 

PROMOTER 




URA3' 




^^^S3Ncol 



LacZ 




ura3-1 



3680bp 



4 PRS3 Locus 

Chromosome VIII 
BamHI 

L_ 

I 



Nsil 



PRS3 

WILD ' TYPE ^ NE y 



2719 bp 



Scheme illustrating the integration of pKalNT(32-1) at the URA3 locus of S.cerevisiae YN94-1. 
pKalNT(32-1) is linearised with Nco\ (1) and integrated at the URA3 locus on chromosome V 
through the process of homologous recombination (2). One wild type URA3 and one mutant 
ura3-1 are therefore generated(3). Finally, two restriction fragments will be detected when 
BamHUNsli digested yeast DNA is probed with the PRS3 promoter region; firstly a fragment of 
3680 bp containing the integrated lacZ fusion and secondly a 2719 bp fragment from the wild- 
type PRS3. (3 & A). 

Fig. 8 
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linfTf 6 9e ' Sh ° Wing BamHl/Nsli ^ests of genomic DNA ofS. cerevisiae 
YN94-1.Lane (1) X BstEH marker, the sizes of the fragments 

?242 ' 6369, 5687, 4822 ' 4324 ' 3675 ' 2323 - 1929 1371 
vmqL' i i?ma 4 . & . 1 ! 7 bp res P ect,ve| y- Lane (2) BamHI/Afc/1 digested 
YN94-1 DNA control. Lanes (3-7) BamHVNsil digested DNA of URA + 
transformants. ^ ■ 



Fig. 9 



3 4 



6 




3680 
2719 



bp 

Corresponding Southern blot of the digests of genomic DNA shown in figure 9 
Lane (1)8amHI//Vs/l digested control YN94-1 DNA. Only one 2719 bp 
band » s seen corresponding to the wild type PRS3 gene on yeast chromosome 
okLt/^i! 5 e T H ' //VS/l dig6StS ° f genomic DNA containing integrated 
•h«p£c? . 9 T el WaS Pr ° bed Wi,h 3 423 b ? EcoRI fra 9™nt derived 
thePHS3 promoter. Two bands are clearly seen; one 2719 bp fraqment 

IZZTT" 9 10 ,he Wi ' d type PRS3 Qene on chromosome VII and a larger 
3680 bp fragment corresponding to the integrated vector at the URA3 locus 
on chromosome V. « _ 

Fig 10 

SUBSTITUTE SHEET (RULE 26) 



WO 98/45476 



PCT/GB98/00869 



11/23 




SUBSTITUTE SHEET (RULE 26) 



WO 98/45476 



PCT/GB98/00869 



12/23 



1 « I 

w is «c 

in ■ 

-a "5 ■£ 

o o " 
(/) O -o 

. *! C 
" o tS 

2 oj w 

R W3 © 

a <u S 

* a * 

O) 5 ci 

£c ? 

<« ^ 

o 

2 S ^ 

* 2 



3' 

O 

o w 
"o 

<D 
Q. 
(0 



c 



c 

0) 

O 



3. O 



S? « o 

£ i 2 

0> c/i r« 

«S a, o 
° £ t5 

*3 •£ DJ) .ft 



CD 



C3 u 
0i o 



CO. V) 



I— I 



CA) ^ C/3 



fa C 



A 



E 
2 

Of) 

c 

0) 



u 
2 

< 

>» 

O 
CL 



c 
a> 

D> 

N 
o 
jo 

a> 

to 

CO 
T3 

3 
o 

J2 
? 
re 

o 

(0 

m 




,• C 

(J ^ CO 

a o p 
° ^ 

v) o .2 
5°g 
.i 2i « 
^£ 

_ G W 

I- 



■a 



0J 



a 



^ CO 



g « S p 

P J) ^ p5 

« c; o 



O 



• o 

C 00 

o 

* 1 



5 



a> 
E 

I 

-C- 



u 
SS 

o 
< 

o 

CL 



SUBSTTTUTE SHEET (RULE 26) 



PCT/GB98/00869 



13/23 



• > 

CM 



4710 msh2::LEU2 

2510 wild type MS H 2 
1650 



Fig. 13A 



O 
ID 

5 £ 



2481 Amsh6::KanMX4 



SUBSTITUTE SHEET (RULE 26) 



WO 98/45476 



PCT/GB98/00869 



14/23 



pRHB113(msh2::LEU2) 

restricted with Spel 

Spel 



Spel 



|msh2 5'i 



LEU2 f 



YN94-1 

wild type (MSH2) 



Homologous recombination 



EcoRI 



Hindffl EcoRI HindHI EcoR[ 




Hindffl 
HindHI 



-1650 



2510 — 



EcoRI 
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1662 bp EcoRI probe 
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Fig 13B 

South rn blot analysis of YN97-10 (msh2::LEU2) 
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1^ 



M 

■ 5800 mlh1::LEU2 
■i 3000 



1000 
bp 



Fig. 14 

Southern blot analysis of YN97-1 1 (mlhl ::LEU2) 



Lane (1) £coRV digested control YN94-1 DNA. Two bends 
are seen (approximately 1000 and 3000 bp) corresponding 
to the wild type LEU2 gene on chromosome III. Lane (2) 
EcoRV digests of yeast genomic DNA containing disrupted 
MLH1. The gel was probed with a 2200 bp LEU2 fragment. 
Three bands are seen in the disruptant, two 
corresponding to the wild type LEU2 on chromosome III 
and the other larger band corresponding to the LEU2 gene 
used to disrupt MLH1 on chromosome XIII 
(approximately 5800 bp). 
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pSRC9(msh6::KanMX4) 
restricted with Sphl/BspEI 
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Homologous recombination 
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from Notl digested pFA6-KanMX4 



YN97-150 
(msh6::KanMX4) 



SphI Spel BamHI BspEI BamHI 



KanMX4 




— 2481 

Fig 15B 

South rn blot analysis of YN97-150 (msh6::KanMX4) 
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Fig 17A 
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Fig 16B 

Southern blot analysis of YN97-167 (rth1::KanMX4) 
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The effect of ethidium bromide on the stability of pKa3-9(32-1) 




01 23A567 89 



Ethidium bromide (ug/ml) 



Fig. 17 
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Fig 17B 

South rn blot analysis of YN98-3 (pol30-104::LEU2) 
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